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Abstract 

Wigner’s marginal probability theory is revisited, and systematically applied to n-particle corre¬ 
lation measurements. A set of Bell inequalities whose corollaries are Hardy contradiction and its 
generalisation are derived with intuitive graphical analysis. 
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1. Introduction 


An unambiguous signal of the departure from local causality in quantum mechanics is ex¬ 
pressed as violation of Bell inequality flj] and its variants, the Bell theorems, such as CHSH 
inequalities j2l, GHZ theorem [0] and Hardy’s arguments flEll , which has been verified by the 
experiments by use of various physical systems II Hi 01 lOr . The relations among these Bell 
theorems have been uncovered gradually EM- The violation of Bell inequality can be viewed 
also as a sign of the breakdown of the classical concepts of joint probability. As independently 
pointed out by Wigner and Fine ijjl 13 03], the Bell inequality, in its original form devised by 
Bell, presupposes underlying joint probability distribution for all possible measurement setups 
and outcomes in the experiment, whose marginal probabilities yield the probability distributions 
of the actual measurement setup of each run. 

In this article, we revisit the argument by Wigner and Fine (TT, 141, 151. and try to make clear 
the relation between their arguments and the concept of local reality. In the process, we uncover 
the graphic structure of the underlying joint probability, which we utilise to systematically to 
search general form of Hardy’s equalities and associated inequalities. 

This article is organized as follows. In Sec. 2, we introduce the concepts of the underlying 
probabilities which respects statistical independence and yield the measurable probability distri¬ 
butions as their marginal probabilities. In Sec. 3, we check consistency between the assumptions 
of statistical independence and the local reality. In Sec. 4, we turn to the graphical representa¬ 
tion of the marginal probability for two-by-two experiment. Using this representation, we clarify 
that Hardy’s equalities can be thought of as an extremal case of CHSH inequality. In Sec. 5, 
we generalise the above observation for multi-setting case and multi-partite case. Sec. 5 has the 
discussion and our conclusion. 
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2. Marginal and Underlying Probabilities, Statistical Independence of Measurements 

Consider a system of two 1/2-spins, each of which are measured separately by two observers 
Alice and Bob, both of whom can set up the measurement devices along two different axes, a — 
0,1 for Alice , and b = 0,1 for Bob. We assume that the result of the dichotomic measurements 
is specified by A = 0,1 for Alice, and B - 0, 1 for Bob. The usual sign notations of projection 
are recovered by identifications sa = (-) A and sb = (-) 1 ’■ 

Let us denote the probability of Alice finding the system in the state A along the a-axis and 
Bob in B along b -axis by P ab (A, B). The assumption of statistical independence is expressed as 

P ab (A,B) = P a (A)P b (B). (1) 

Let us now consider parallel measurements on an ensemble of the system in two combinations of 
axes pair (a, b) and ( ab’). We write the joint probability of finding the results (A, B) and (A', B'j 
as W([A, B] ab , |A', /!']„-//). If the parallel measurements are statistically independent, we should 
have 


W([A, B ] ab ; [A', B'W) = P ab (A, B)P a , b ,(A’, B'), (2) 

From the assumption Q. we have 

A)o(A„, Bo)P n(Ai, B\) = P oi(A 0 , B,)P 10 (A], B 0 ) (3) 

since both are given by the product P^{Aq)P\{A\)Pq{Bq)P\{Bi). This immediately leads to the 
equivalence between W([Ao, Bolooi [Ai, Z?i]n) and W([Ao, Bijoi; [Ai, Bolio)- This means that we 
can define unconditional underlying probability p(Ao, Bo; A\, B\) which is the joint probability 
of Alice finding her system in state Aq along 0-axis, A i along 1-axis and Bob finding his particle 
in state By along 0-axis, B\ along 1-axis; 

P{Aq,A\\Bq,B{) = W([A 0 ,B 0 ]oo; [A^Bijn) = W([A 0 , Bi]oi; [Ai,B 0 ]io), (4) 

which signify the “reality” of the set of physical observables (Ao, /iy; A\, B\) irrespective to the 
measurement. There are 2 4 = 16 of Us. As probabilities, they are all non-negative real numbers, 
which add up to unity. The observable probabilities / J , ; are obtained from p by partial sum as 

Foo(Ao,B 0 ) = ^ p(Ao,B 0 ;Ai,Bi), Bi 0 (Ai,B 0 )= ^ p(A 0 ,By;Ai,Bi), 

A\,B\ Aq,B\ 

B 0 i(A 0 ,B 1 )= ^pCAo.BojAi.B,), Bn/A^BO = ^ P(Ao,B 0 ;A^Bi). (5) 

Ai,5 0 Aq,Bq 

Following Wigner, we call the probability P ab s that are obtainable from direct measurements 
as marginal probabilities. Although the existence of unconditional probability p which is guar¬ 
anteed by the relations (0J), or equivalently ([3}. are derivable from the statistical independence 
between Alice’s and Bob’s observables, 0. the former is a looser assumption, from which the 
latter may not necessarily follow. So irrespective to the logic we have followed until now, we 
shall use the existence of unconditional probability U expressed in relations 0 and 0 as the 
basic assumption. 
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3. Derivation of Joint Probability Based on Locally Realistic Theory with Hidden Variable 

Although in the derivation of p(Ao, Bq\A\, B\ ), the cornerstone of our approach, we have pre¬ 
tended that it is derived from the statistical independence assumption of marginal probilities, it 
can be derived by quite deferent set of assumption based on the concept of hidden variable the¬ 
ory with local realism. Of course, this is how the Bell inequality has been historically derived, 
and this is exactly the reason that Bell theorem is termed one of the most profound theorem in 
phyisics. We need, however to put it in perspective in view of the fact that there are vocal dis¬ 
senting view El on the significance of Bell experiment that what is proven is just the statistical 
separability of probabilities, not the negation of local realism. 

Here, we detail how Wigner has arrived at conditional and unconditional underlying proba¬ 
bilities W and p, stating from deterministic theory with hidden variables. Although we base our 
argument on the specific case of two-by-two Bell experiment, readers will see its generality and 
applicability to broader situations. We first want to construct a framework of deterministic the¬ 
ory with hidden variables, that are, unknown variables which are not to be observed directly, but 
whose ensemble average generates all 2 4 = 16 marginal probabilities P a b(A, B), that completely 
specifies the outcomes of two-by-two Bell experiment. 

It is, in fact, rather easy to have such a theory, given sufficient number of variables are brought 
in. Consider four variables q a b with a - 0, 1 and b = 0,1. we assume q a b takes four values 0,1,2 
and 3, which we express as 


d_ab ~ A a b + 2 B a b ( 6 ) 

with binary variables A a b - 0, 1 and B a i, = 0,1. We can alternatively think a byte variable A 
made up of eight bits {A a b, B a b }, namely 

A = {Aoo,Boo,Aio,Bio,Aoi,B()i,Aii,Bii} (7) 

as our hidden variables. These variables are governed by some unspecified dynamics. It is just 
sufficient if we accept that the result of the projection measurement with axes a for Alice and b 
for Bob is determined by the value of q a b in such way that the projections sa and sg for Alice 
and Bob respectively are given by sa = (-) Aab and sb — {-) Bab . 

We now consider an ensemble of this deterministic system whose density of distribution on 
the space of A is given by 

W(A) = W(A 00 , Boo; A 10 ,B 10 ; A 0 i,B 0 i; An,Bn) (8) 

The observable probability P a b are given by 

^oo(A, B) - EEZ W(A,B; Aio, Bio; Aoi, Boil An, Bn), 

Aio,#io Aoi,#oi An.#n 

P to(A, B) = £ Z Z W(A i)0 ,B m -,A,B-,A 0u B m -,A n ,B n ), etc., (9) 

A()0.#00 A()l,#01 An,#11 

and we should always be able to construct a deterministic theory whose initial ensemble of vari¬ 
able A (t = 0), that corresponds to the initial setup of the experiment, evolve into the distribution 
W(A) with A = A(f) at the time of the measurement, that reproduces the observed P a h(A, B). It 
has to be remarked that the existence of this supposed underlying theory itself is rather unremark¬ 
able. The fact, that it requires an eight-bits variables capable of 2 8 = 256 values to reproduce 16 
experimental numbers, makes this theory an example of “cost-ineffective” generalization. 
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Now, we consider only those deterministic theories that respect local realism a la J.S.Bell. 
Then, the result of the Alice’s measurement should not depend on how Boh’s measurement device 
is placed, and Bob’s result should not depend on how Alice’s measurement device is placed. The 
values of the variables, in this type of theory, have to be limited in such way to reflect this fact, 
namely 


Aoo = A 0 i = Aq, Aio=An=Ai, Boo-B w = B 0 , B 0 i = B n = B\. (10) 

So half of the bits in the variable A are redundant. In other word, local realistic theories underly¬ 
ing the observable P a b(A, B) is to be described by a half byte variable A, which we define in the 
form 


A = {A 0 ,B 0 ,A l ,B l }. (11) 

The ensemble-averaged system we observe in experiments is to be specified by the density of 
distribution 


p(T) =p(A 0 ,B 0 ;A 1( B 1 ), (12) 

which gives the observable probability P a b in the form 

Poo(A,B)= ^ p(A, B;Ai, Bi), P W (A, B) = p(A 0 , B;A, B,), etc.. (13) 

A\,B\ Aq,B\ 

Note that the variable A can take 16 discrete values, as opposed to 256 for A. Consequently, 
there are 16 of p(A) functions as in contrast to 256 of W( A) functions. 

In formal term, the local realistic subclass of hidden variable theory is obtained from all 
possible theory by the reduction of density distribution 

W(A 00 ,Boo;Aio,Bio;Am,Z?oi;A ii,Bh) = p(Aoo,Bio;An,Boi)dA 0 o,Aoi^Aoi,A 1 i5B 00 ,Bio^Boi.Bii> (14) 
or conversely, 

p(A 0 ,B 0 ;A 1 ,B 1 ) = WlA^B^AuB^A^Bp.AuB,). (15) 

We see that the “hidden variables” A a b , B n b and their reductions A a , B/, are indeed hidden 
behind their guise as projection indices in W and U. 

4. Diagrammatical Proof of Bell inequalities and Hardy Contradiction 

The assumption of the existence of underlying unconditional probability p(Ao,Bo;Ai,Bi) 
leads to several relations among marginal probabilities P a b(A, B). 

Let us first reconfirm our notational conventions. The index a, that takes two values 0 and 1, 
stands for the axis of choice by Alice, along which the first particle is projectively measured to 
yield the value sa = + or -, which is alternatively expressed as A = 0 or 1. Similarly, the index 
b, that can be 0 or 1, stands for the axis of Bob’s measurement, which yields the value Sb = + or 
- that is alternatively expressed as B = 0 or 1. We define inter-particle axis specific projections 


i - Aq + 2Bq, 


j — A i + 2B[. 
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(16) 


The value of i represents the combined projection along the “0” axes of Alice and Bob, and j 
along their “1”- axes. We now reorder p{A$,A\\Bq,B{) by the indices i and j with the use of a 
matrix V defined by 


Vij - p(Ao,Bo;Ai,Bi)|(/ = a 0 +2Boj=Ai+2Bi)- (17) 

The partial sum <[5j then becomes 

3 1 

P n (A, B) = ^ V a+2 bj Pio(A, B ) = z ( V2B+i,A + VlB+iMl) » 

7=0 i=0 

1 3 

P oi(A, B) = Y j (Va,2 B+ 7 + V A+ 2,2B +J ), Pn(A, B ) = Z V,A+2fi (18) 

7=0 ( =0 

These expression has a very intuitive graphical representation Fig. 1. If we place p on a 4-by-4 
grid according to the indices i and j, Poo are given by sums along horizontal lines. Pi i by sums 
along vertical lines, while P \o and Pq\ are given by sums ribbon-shaped lines vertically and 
horizontally placed. 



Figure 1: Marginal probabilities P a b(A, B ) represented as a line or linked lines which cover the the grids that represent 
the underlying probabilities Vjj to be summed up. 


From the compositions (fl8l >. it follows, for example, that 


Pio(0,0) + P 0 i(0,0) + F n (l, 1) = 2V 0 ,o + Vo.i + Vo ,2 + V 0>3 

+Vi,o + Vi,2 + V13 

+ V27) + V24 + V 2,3 

+ V 3 ,3 


which contains 


09) 


Foo(0,0) - Vo,o + Vo.i + Vo : 2 + Vo, 3 

and some more positive quantities. Thus we have a Bell inequality in the form 


( 20 ) 


P to(0,0) + P 0 i(0,0) + P n(l, 1) - Foo(0,0) > 0 (21) 

A graphical representation of this inequality is shown in Fig. 2. 

A corollary immediately obtained is that 

if Fio(0,0) = Poi(0,0) = Pn(l, 1) = 0 hold, then Poo(0,0) = 0 follows. 
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( 22 ) 









o o o 

P 01 (0,0)+P 10 (0,0) 

+P n (l,l)>P QO (0,0) 

Figure 2: Graphical depiction of a Bell inequality among marginal probabilities P a b(A, B) represented as a line or linked 
lines for terms in LHS, a dashed box for the RHS, superimposed on the grid representing the underlying probabilities 
Vij to be summed up. 

This is nothing but the equality obtained by Lucien Hardy. A similar construction to (fl9l . (l20l > 
leads to 


P 10 (l, 1) + Poi(l, 1) + B n (0,0) - Poo(l, 1) > 0, 

Pw( 1,0) + Poi(l.O) + P n(0,1) - Poo(l.O) > 0, (23) 

Pi o(0,1) + Poi(0,1) + Pu 0,0) - Poo(0,1) > 0. 

With the definition of corelation C a b, 


C ab = Pabi 0,0) - P ab ( 1,0) - P ab (0, 1) + Pab{ 1, 1) (24) 

inequalities (OH and (l23b lead to 


|Cio + C 01 + Ci, - Cool ^ 2, (25) 

which is a celebrated CHSH inequality. 

With the interchanges of axis indices a and b, and with interchanges of projections 0 and 
1 for both A and B in P a b(A, B), there are 64 inequalities of the type (PT1 ). Each four of them 
sharing the same leg ab form a single CHSH inequality. As an obvious corollary, there are 64 
variants of Hardy equality. 

5. Three Particle Generalization and More 

The extension to n-particle case is rather straightforward. We illustrate it with three particle 
case. Let us now consider a system consisting of three spin 1/2 particles that are respectively 
measured by Alice, Bob and Chris, all of whom have two choices each for the orientation of 
measurement represented by a = 0 or 1 , b = 0 or 1 and c = 0 or 1, respectively. Alice’s 
measured projection is represented by A a — 0 or 1, Bob’s by B/, - 0 or 1 and Chris’ by C c = 0 or 
1. If there is an underlying probability p(Ao, Bo, Co; A \ , B i, C\ ), that presuppose the simultaneous 
existence of all projections for all three observers, the marginal probabilities P a b c s which are the 
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Figure 3: Graphical depiction of three-body marginal probabilities P a b(A, B,C) represented as a line or linked lines 
superimposed on the grids representing underlying probability Vjj to be summed up. 


joint probabilities of direct observables are given by the partial sums 

Pooo(A,B,C)= Y P(A,B,C-,A',B',C), P 100 (A,B,C) = Y PfA.B,C-,A',B',C), 

Poio(A,B,C)= Y P( A ’ B ’C\A',B',C), Puo(A,B,C)= Y P(. A ’ B ’C;A',B',C'), 

A' ,B',C' A',B',C' 

P 00 i(A,P,C) = Y P( A ’ B ’C\ a ',B',C'), P 10 i(A, P, C) = Y P(A,P,C;A',P',C'), 

A',B',C' A',B',C' 

Pon (A, B, C) = Y P(A,P,C;A',P',C'), Pm(A,P,C) = Y P(A,P,C;A',P',C'). ( 26) 

A',B',C' A',B',C' 

This set of construction may or may not result in the assumption of statistical independence 

P abc (A , B, C ) = P a (A)P b (B)P c (C), (27) 

although the latter necessarily results in the former. In case (1771) holds, we have 

Pooo(A,P,C)Pih(A',P',C') - P 00 i(A, P, C')Pno(A', P', C), 

Pooo(A, B, C)Piio(A', P', C) = Pi 00 (A', B, C)P 010 (A, P', C'), 

Pm (A, B, C)P ooi(A', P', C') = P 0 n(A', B, C)P 10 i(A, B', C'), (28) 

etc.. With the three-digits grouping of indices 

Vij = p(A 0 , B o, Q; Ai, Pi, Ci)|(/ = a ij +2Bo+4CoJ=A 1 +2B 1 +4C 1 ) 5 
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( 29 ) 










the partial sums (l26l > become 


' i,A+2B+4C » 


Pooo(A,B,C ) - ^ Va+2b+4cj, Pu\(A,B,C) - z V 

3= o i= o 

3 3 

Pno(A, B, C) {V$c+iA+2B + V 4 C+ 1 A+ 2 B+ 4 ) , Pooi(A,B, C ) = (Va+ 2£,4C+7 + Va+ 2B+4,4C+7') ’ 

i =0 2=0 

1 

PlOo(A, B, C) = ^ (V2B+4C+(,A + V r 2B+4C+!,A+2 + V2B+4C+;,A+4 + V2B+4C+i,A+6) > 
i=0 

1 

B011 (A, B, C) = ^(f a .2B+4C+7 + ^A+2,2B+4C+7 + Va+4,2B+4C+ 7 + Va+6,2B+4C+ 7) , 

3=0 

1 

Boio(A, B, C) = ^ (^A+4C,2B+; + Va+ 4C+2,2B+7 + Fa+ 4C,25+4+7 + Kt+4C+2,25+4+7') > 

3=0 

1 

Bi0l(A, B, C) = ^ (V2B+!\A+4C + V’2B+i4+4C+2 + ^25+4+1,A+4C + l / 2«+4+(,A+4C+2) • (30) 


i=0 


This extends the previous graphical expression of 4-by-4 matrix grid for two-particle case with 
8-by-8 matrix grid on which ps are placed with indices i - ,4o + 2Bo+4C'n and j - A\+2B\+AC\. 
Bo 00 and Pm are sums over horizontal and vertical lines, while Pmo, Poio and Pqoi are sums over 
variously shaped ribbons, and Pon, Pioi and Pno their respective mirror images with respect to 
the diagonal lines, all indicated in Fig.3. Comparison between Fig. 1 and Fig. 3 reveals how the 
n — 3 graphs are made out of n — 2 graphs; Pooo, P too and Poio are just the ’’sideway doubling” 
of Poo, Pio and Poi, and the remaining Pm, Pon and Pmi are obtained by mirroring with respect 
to diagonal line that corresponds to the reversing axis indices (a, b, c: 0 1). This construction 

carries over to any n —» n + 1 extension. Drawing various ribbons on the matrix grid as before, 
the following inequality is shown to hold 


P,oo(0,0,0) + Poio(0,0,0) + P 00 i(0,0,0) + P m (l, 1,1) - P 00 o(0,0,0) > 0 (31) 

whose corollary is a three body extension of Hardy equality, which states 

if Pioo(0,0,0) = Poio(0,0,0) = Pooi(0,0,0) = P m (l, 1,1) = 0 hold, 

then Pooo(0,0,0) = 0 follows. (32) 

The geralization of this result 

P ioo...o(0,0,..., 0) + P oio...o(0,0,..., 0) + ... + P oo...oi(0,0,..., 0) 

+P,i...i(l, 1, 1) - Poo...o(0,0,..., 0) > 0, (33) 


and its associated Hardy equality is not hard to prove. Quite separately from above extension, 
we can prove a Zukowski inequality 


Cm - Cooi - Coio - Cioo > -2 


(34) 
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+P in (l,l,l)>P in (0,0,0) 

Figure 4: Graphical depiction of three-body marginal probabilities for the extended Hardy contradiction, represented as 
a line or linked lines for terms in LHS, a dashed box for the RHS, superimposed on the grid representing the underlying 
probabilities Vjj to be summed up. 


in a similar fashion with the ribbons on 8-by-8 grid, whose corollary 

if Coot = Quo = Qoo = 1 then Cm = 1 (35) 

is, of course, the negation of GHZ contradiction. 

Another set of extension exists in the form of two party m -axes Bell experiments. Take for 
example, a system of two spin 1/2 particle each measured by Alice and Bob both of whom now 
has a choice of three projection axes a =0,1,2 and b = 0,1,2. Experimental results are specified 
by 64 marginal probabilities P a b(A, B). The assumption of underlying unconditional probability 
distribution now involves 64 quantities p(Ao, Bq\ A\,Bp, /G, Bij. The proper tabulation of p now 
requires the definition of three two-digits indices 

i k =A k + 2B k , k = 0,2,3, (36) 

thus is a cube grid of size 4-by-4-by-4. As depicted in Fig. 5, P kk is the sum over a slice parallel 
to the cube surfaces, and P k i with k 4 l half sums of two non adjacent surfaces. 

In a similar fashion to the previous cases, although now requiring some 3-dimensional recog¬ 
nition of patterns, we can prove a set of three-axes nonlocality inequalities. One such example is 
depicted in Fig. 6, showing an inequality 

Foo(l, 1) + Pw(0, 0) + P o 2 (0,0) - P n ( 0,0) > 0. (37) 

This is a three-axes version of inequalities of the type d2Tb . In a sense, it can be regarded as 
physically identical to them, since just by renaming the axis ”2” of Bob as ”1”, this simply 
reduces back to one of the two-axis type inequality. But this inequality does involve genuinely 
different three axes. We may obtain a three-axes version of Hardy type equality again by setting 
the fisrt three term of (|371 > to be zero. Instead, we show another interesting face of this inequality 
by it requiring only the first term to be zero. We then have 

if Poo(l,l) = 0, then P 10 (0,0) + Po2(0,0)-Pi 2 (0,0)>0. (38) 

This is nothing but the original Bell inequality in the form devised by J. S. Bell expressed in 
marginal probabilities instead of correlation functions, and the proof shown here is just the graph¬ 
ical dressing of Wigner’s proof. Note that the original requirement of “being in singlet state” is 
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p 20 ( 0,0) p 21 (0,0) p 22 (0,0) 


Figure 5: Graphical depiction of two-body three-axes marginal probabilities P a b(A,B) represented as planes painted in 
identical colors, which cover the three-dimensional grids that represent the underlying probabilities Vu# to be summed 
up. 
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^ 12 ( 0 , 0 ) 


Figure 6: Graphical depiction of two-body three-axes Bell inequality which leads to “Bell’s” Bell inequality. Marginal 
probabilities P a z,(A, B) are represented as identically colored planes for terms in LHS, a transparent square for the RHS, 
which are superimposed on the grid representing the underlying probabilities to be summed up. 


loosened to Poo(l, 1) = 0. We can now see that “Bell’s” Bell inequality occupies a midpoint 
between CHSH type inequality and Hardy type equality. 

6. Discussion 

The derivation of Bell inequalities with the use of underlying probabilities examined here is 
quite general. The argument is very straightforward, and this approach, in principle, is extend¬ 
able to Bell inequalities with four or more choices and also to four or more players. But actual 
graphical representation quickly becomes messy and intractable for higher number of choices 
and players, and therefore, is not expected to be competitive against traditional systematic ap¬ 
proaches |2^]. There is, however, an advantage in our approach of having intuitive graphical 
representation, which should not be missed in pedagogical settings. 

We thank Prof. I. Tsutsui and Dr. T. Ichikawa for stimulating discussions. This research was 
supported by the Japan Ministry of Education, Culture, Sports, Science and Technology under 
the Grant number 15K05216. 
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